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ABSTRACT 


Efficient algorithms for computing a centerpoint have important applications in Com- 
putational Geometry. We explore these applications in this thesis. In particulai, we discuss 
a randomised algorithm which uses the notion of a centerpoint, for finding a small graph 
separator of an overlap graph of a neighborhood system. Next we present an optimal paral- 
lel algorithm for computing a centerpoint of a finite planar point set in the exclusive read, 
exclusive write parallel random access machine model. The algorithm uses niog* ' n 
cessors and takes O(log^ n log* n) time. Finally we extend the notion of a centerpoint to 
the weighted case. We show that a weighted centerpoint always exists for any configura- 
tion of points with arbitrary weight assignments. We then present a linear time algorithm 
for computing a weighted centerpoint of a configuration of points that form the vertices 
of a convex polygon. In addition, we give an O(nlog® n) algorithm for finding a weighted 
centerpoint of a general configuration of points. 
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Chapter 1 


Introduction 


1.1 The centerpoint problem 

The center of every centrally symmetric figure has the property that every chord through 
it bisects the area and the bounding curve of the figure; the chord itself is bisected at the 
center. Clearly there are planar figures having no center of symmetry - a point which bisects 
every chord passing through it. 

In daily life, we often use phrases like ”the very center of a city”, even when a city has 
no center in any exact sense (as above). We all have an intuitive understanding as to what 
these phrases mean. The notion of the center of a finite set of points captures this intuition 
in a quantitative way for point sets. Similar notions of center exist for bounded planar 
curves, bounded planar figures et cetera [YB61]. 

Let «S be a set of n points in d-dimensional Euclidean space. 

Definition 1.1 A point x in d-space, not necessarily in S, is a centerpoint of S if every 
closed half-space including x contains at least points of S. 

Definition 1.2 The center ofS is the collection of all centerpoints. 

Using the above definition, we can deduce an alternate characterisation of the center as 
the intersection of all closed half-spaces, each containing more than points of S. 

The center is always nonempty for any configuration of points. This can be proved using 
the above alternate characterisation and Helly’s theorem. We have the following theorem : 

Theorem 1.1 Every finite set of points in d-dimensional Euclidean space admits a center- 
point. 
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The computation of a centerpoint of a finite set of points in two or higher dimensions 
is of fundamental importance in geometric algorithms that require a balanced partitioning 
of the input point set. We give a brief survey of the previous work done regarding the 
computation of a centerpoint of a finite point set in two and three dimensions : 

1. Using the powerful technique of Parametric Searching [MegSSa], Cole et a/ have given 
an 0(nlog®n) algorithm for computing a centerpoint (CSY87] in two dimensions. 
Subsequently Cole improved it to O(nlog^ n) by using a refinement of the Parametric 
Searching technique [Col87]. The algorithm has been extended to three dimensions 
by Naor and Sharir [NS90] with a time complexity of 0(n^log®n). 

2. Matousek proposed an 0(n log'* n) algorithm for computing the whole center of a set 
of n points in the plane [Mat92]. In addition, he gave a linear time algorithm for 
finding an approximate centerpoint, as close to the exact one as we want in any fixed 
dimension [Mat91]. 

3. Very recently, Jadhav and Mukhopadhyay gave a linear time prune-and-search algo- 
rithm for computing a centerpoint in two dimensions [JM93]. Their algorithm uses 
another important notion in Computational Geometry - that of a ham-sandwich cut. 

1.2 The ham-sandwich cut problem 

We first define the notion of a bisecting hyperplane. Let P be a set of n points in E^. 

Definition 1.3 hyperplane h is said to bisect V if neither of the two open half-spaces 
defined by h contain more than j points ofP. 

Definition 1.4 A ham-sandwich cut is a hyperplane that simultaneously bisects d point 
sets in E^. 

The ham-sandwich cut theorem guarantees the existence of such a cut. 

Theorem 1.2 Let Pj, P 2 , . . . , Pj be d finite sets of points in E^. There exists a hyperplane 
h that simultaneously bisects Pi, Pj, . . . , Pj. 

In two dimensions, a ham-sandwich cut is a line that simultaneously bisects both the 
given point sets. The generalised ham-sandwich cut problem asks for a line that splits the 
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two sots in R|>('cifio(l ratios. Wlirn the point sets arc linearly separated, it is always possible 
to find such a line. 

We can use the standard transformation [Ede87] which maps points to lines and then 
look at the problem in the dual plane (See appendix A). If the given two point sets are 
linearly separated, then in the dual plane, we have two sets of lines: one having lines with 
positive slopes and other having lines with negative slopes. The ham-sandwich cut in the 
primal piano corresponds to the point where the median levels intersect in the dual plane. 
Any level of the first set is a monotonically increasing function and that of the second set 
is a monotonically decreasing function and hence always intersect. The generalised ham- 
sandwich cut problem requires one to find the point where the level of the first set and 
the level of the second set meet, given p and q. 

Megiddo gave a linear time algorithm for computing a (generalised) ham-sandwich cut 
in the linearly separated case [Meg85]. Subsequently Lo and Steiger improved it to the 
general case [LS90]. 

1.3 Our thesis 

In chapter 2, we explore the applications of computing a centerpoint, particularly in finding 
graph separators. Shang-llua Teng [Tng91] has described an algorithm for finding a graph 
separator of a geometric graph, known as overlap graph, which uses the notion of a cen- 

I. 

terpoint. We discuss his algorithm and then outline the applications of computing graph 
separators in Numerical Analysis - for solving sparse linear systems of equations. 

In chapter 3, we present an optimal parallel algorithm for finding a centerpoint of a finite 
point set in two dimensions, in the exclusive read, exclusive write parallel random access 
machine model. The algorithm takes 0(log®nlog*n) time using processors. It 

uses, as a subroutine, an algorithm for finding a generalised ham-sajidwich cut(separable 
case). We present an optimal parallel algorithm for this as well, taking O(log^ i\riog*JV) 
time using jpgS mos* l v Processors where N is sum of the cardinalities of the two sets involved. 

In chapter 4, we generalise the notion of a centerpoint of a finite planar set of points. 
Each point in the set is associated with a positive real number called its weight and the 
notion of a weighted centerpoint is defined in an analogous manner to that of an ordinary 
centerpoint. We show that a weighted centerpoint always exists for any configuration of 
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points with arbitrary weight assigmncuts and give a linear time algorithm for computing a 
weighted centerpoint when the input points are the vertices of a convex polygon. Finally, 
we generalise Cole et ofs algorithm to work within the same time bound for any weighted 
configuration of points. 



Chapter 2 


Applications of Computing a 
Centerpoint 

2.1 Introduction 

Computing small graph separators has important applications in Numerical Analysis and 
Computational Geometry. The most classical applications of the separator results are 
Nested Dissection [AG73] and Generalised Nested Dissection [LRT79]- techniques used 
for solving large sparse linear systems of equations. Shang-Hua Teng, in his Ph.D the- 
sis [Tng91], proposed a new class of geometric graphs, known as overlap graphs, which 
includes, inter alia, planar graphs and meshes as special cases. This class of graphs is 
defined based on elementary geometric objects such as points, balls etc. He described an 
algorithm for computing a small separator of an overlap graph, which uses the notion of a 
centerpoint. This algorithm will be discussed in section 2.3. In section 2.4, we outline the 
applications of computing graph separators in Numerical Analysis- for solving sparse linear 
systems of equations. 


2.2 Graph separators 

A set of vertices whose removal disconnects a given graph is known as a vertex separator. 
Edge separators are also defined similarly. 

Definition 2.1 A subset of vertices C of a graph G with n vertices is an f{n) -separator 
that 6 -splits if \C\ < f(n) and the vertices of G — C can be partitioned into two sets A 
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and B such that |j4|,|5| < 6n and there is no edge between A and B, where f is a positive 
function and 0 < ^ < 1. 

When we say that a graph G has a small separator, we mean that there is a constant 6, 
0 < 6 < 1, and a sublinear function / such that G has an /(n) separator that ^-splits. We 
say that a class of graphs has an /(n)-separator theorem if there exist constants ^ < 1 and 
> 0 such that every graph in the class has a ;9/(n)-separator that 6-splits. 

Divide and Conquer paradigm, when applied to graph problems, requires partitioning 
the given graph by removing a subset of vertices or edges. The subproblems are those 
associated with the connected components of the resulting graph. The cost of combining 
usually depends on the size of the separator used. 


2.3 Application of centerpoints in finding graph separators 

In this section, we introduce the notion of overlap graphs and show that the class of overlap 
graphs includes grids, planar graphs and ^-nearest neighborhood graphs as special cases. 
We then discuss an algorithm for finding a small separator of an overlap graph. The results 
in this section are from [Tng91]. 

2.3.1 Some simple classes of graphs 

One of the simplest classes of geometric graphs is the class of grid graphs. In two dimensions, 
they are also known as meshes. Another example is the class of Ar-nearest neighborhood 
graphs- which has important applications in Computational Geometry. 

Let V = {pi, . . . ,Pn} be a set of n points in Let Nk{pi) denote the set of k nearest 
neighbors of pi; ties are broken arbitrarily. 

Definition 2.2 A k-nearest neighborhood graph ofT= SR:** is a graph with 

vertices V = {1,. . . ,n} and edges E where E = {{i,j) t Pi € Nk{pj) or pj € Nk{pi)}- 

Yet another class of graphs in which we would be interested in this section is the class 
of planar graphs. 
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2.3.2 Neighborhood systems and overlap graphs 

We now give a sequence of definitions and results - either omitting the proof fully or only 
giving a brief sketch of it. The details can be found in [Tng9i}>. 

Definition 2.3 An Euclidean neighborhood of a given point p £ ^ is a closed ball of 
certain radius centered at p. A neighborhood system E = {Bi,. . . ,Bn} is a finite collection 
of neighborhoods. 

Let S = {5i, . . . , jB„} be a neighborhood system. Let pi be the center of Bifl<i< n. 
"P — {Pi > • • • > Pn} is called the centers of S. 

Definition 2.4 For any integer k, H is a k-neighborhood system if, for all i, 1 < i < n, 
the interior of Bi contains no more than k points from V. 

Given a ball B of radius r and a positive real number a, a.B denotes the ball with the 
same center but radius ar. If a > 1, then the ball a.B is known as the a-dilation of B. 

Definition 2.5 The kissing number in d-dimensions, denoted by rj, is defined to be the 
maximum number of non-overlapping unit balls in 3?*^ that can be arranged so that they all 
touch a central unit ball. 

The kissing numbers in one, two and three dimensions are 2, 6 and 12 respectively. 
Given a neighborhood system 5 and a point p € densitys.{p) denotes the number of 
neighborhoods in E that contain p (either in the interior or on the boundary). 

Lemma 2.1 (Density Lemma) For each k-neighborhood system E = {Bi,...,Bn} in d 
dimensions, for each p € densitys(p) < rfit. 

Proof: See [Tng91]. □ 

Given a neighborhood system, two graphs can be defined - an intersection graph and 
an o-overlap graph (a > 1). Both have the same set of vertices i.e. one vertex per ball. 
The intersection graph has an edge between two vertices whenever the corresponding balls 
intersect. The a-overlap graph, a > 1, has an edge between the vertices corresponding 
to two balls if the a-dilation of the smaller ball intersects the larger ball. Note that the 
intersection graph is the same as l-overlap graph. 
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We now prove that the class of overlap graphs includes grids, planar graphs and k- 
nearest neighborhood graphs as special cases. To precisely define what the above statement 
means, we introduce the notion of an embedding. 

An embedding of a graph G = (V, E) in is a mapping 3?“^. An edge (u, v) of 

G is mapped to the line segment (7r(«),7r(t;)). 

Definition 2.8 A graph G = iV,E) is k-embeddable in 3?“* if there is an embedding v of G 
such that for all {u,v) £ E, BuC) Bv ^ 4>> tvhere Bu is the largest ball centered at 7r(u) that 
contains no more than k points from {7r(tt)) : w € V} tn its interior. Similarly, G = {V,E) 
is {a,k)-embeddable in 3?*^ if there is an embedding v of G such that for all («,v) € E, 
Bu n {a.Bu) 7 ^ 4> and (a.jBu) 0 B^ ^ 4>. 

The above definition implies that 

1. A graph G is fc-embeddable if G is a subgraph of the intersection graph of some 
fc-neighborhood system. 

2. A graph G is (a, A:)-embeddable if it is a subgraph of the a-overlap graph of some 
fc-neighborhood system. 

Theorem 2.1 The class of overlap graphs includes grid graphs in d-dimensions. 

Proof: Given a grid graph, we define a 1-embedding in rf-space as follows : At each vertex 
of the grid, we put a ball with radius The given grid graph is the intersection graph of 
this neighborhood system. □ 

Theorem 2.2 The class of overlap graphs includes planar graphs. 

Proof: See [Tng91]. □ 

Theorem 2.3 The class of overlap graphs includes k-nearest neighborhood graphs in d- 
dimensions. 

Proof: Let P be the given set of n points in d-space. A fe-embedding is defined by 

letting the points to map to themselves; Bi denotes the largest ball centered at pi whose 
interior contains no more than k points from P. According to the definition of fc-nearest 
neighborhood graphs, there is an edge between points p, and pj only if either p,- is in Bj or 
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Pj is in Bi. Whenever this is the case, Bi and Bj intersect and hence the intersection graph 
has an edge between p.- and pj. Thus it follows that each fc-nearest neighborhood graph in 
d-dimcnsions is A:-ombcddablc in d-space. O 

Thus to find a separator for a given graph (from the above classes), we embed it in 
d-space and find a neighborhood system, the overlap graph of which is a "super graph" of 
the given graph. We then proceed to compute a separator for this overlap graph, which will 
be a separator for the original graph as well. 

2.3.3 Sphere separators 

Each sphere S in separates int{S) from ext(5) in the sense that any segment connecting 
a point in int{S) with one in ext{S) must intersect S and is known as a sphere separator. 
If P is a set of n points in ^ and S a constant, 0 < ^ < then S ^-splits P if both 
|in<(S') n P| < and |ex<(5) fl P| < 6n. 

Cost of a sphere separator 

Just as a vertex separator has a cost (usually the number- of vertices in the separator), a 
sphere separator is also associated with a cost - the surface area of the sphere being the 
most natural choice. The surface area may either be weighted or unweighted. 

The unweighted surface area of a sphere in d-space with radius r.is given by 

.r(d/2) 


where F is the gamma function. 

In the weighted case, there is a real valued non-negative function f(x) such that /* is 
integrable for all I: = 1, 2, 3, . . .. Such an / is called a density function. It may be defined 
on 3?^ or on a unit d-sphere in We consider the case where / is defined on a unit 

d-sphere, say Ud, in The total volume of / is defined to be 

total -volume{f) = L ^y{v)f{dvY 

A great sphere of Ud is the intersection of Ud with a hyperplane passing through the 
center of Ud- The weighted area of a great sphere GS of Ud is given by 
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Areaf{GS)^ / Uiv)Y-\dvY-'^ 

Jv&GS 

Let avg{f) be the average area over all great spheres of Ud- We have the following 
proposition : 

Proposition 2.1 Suppose f is a density function on a unit d-sphere Ud, then avg(f) = 
Ad~i (^{total.volume{f))~^ where Ad stands for the surface area of Ud- 

Proof: See [Tng91]. □ 

Intersection number 

Let S be a neighborhood system in d-space. Each (d — l)-dimensional sphere S partitions 
the balls in E into three sets : those which are completely in the interior of S (E/), those 
which are completely in the exterior of 5 (Eb) and those which intersect 5 (Eo). The 
cardinality of Eo is known as the intersection number of S and is denoted by is(S). 

Removal of the balls in Eo splits E into two subsets E/, Ee such that no ball in E/ 
intersects any ball in Eg and vice versa. It follows that the removal of the vertices corre- 
spending to the balls in Eo from the intersection graph of E separates the graph. So we 
have the lemma : 

Lemma 2.2 If S is a sphere that 6-splits the centers of a neighborhood system E, then the 
vertices corresponding to Eo defined above is an iz{S)-8eparator that 6-splits the intersection 
graph ofE. 

So to find a small separator for the intersection graph of a neighborhood system E that 
^-splits, one finds a sphere separator S which ^-splits the centers of E and having a low 
intersection number. The vertices corresponding to the balls intersected by S give the vertex 
separator and the intersection number is the cardinality of the vertex separator. However 
these vertices need not separate the a-overlap graph of E. To separate the a-overlap graph, 
we remove, in addition to the above vertices, the vertices corresponding to the balls which 
satisfy the following two conditions : 

1. Their radii should be less than or equal to that of the radius of the sphere separator 


S. 
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2. Their a-dilations should intersect 5. 

The total number of such vertices is called the overlap number of S and is denoted by 
Note that in both the cases, the vertex separator can be found from the neighborhood 
system and the sphere separator in linear time. 

Existence of a sphere separator with low cost 

The stereographic projection is a mapping which maps ^ plus infinity onto the unit d- 
sphere in with origin as the center, denoted by Ud. The pre-image of a great sphere 
GS of Ud is a (d - l)-sphere, say 5, in 3?“^. Moreover the interior and the exterior of S 
are mapped to the two hemispheres of Ud, defined by GS. S ^-splits a point set P iff GS 
^-splits Q, the image of P under stereographic projection. Now if the origin is chosen to 
be a centerpoint of Q, then the pre-image of any great sphere will g^-split P. Based on 
these ideas and using proposition 2.1, Miller and Thurston derived the following continuous 
separator theorem when the weighted surface area is used as the cost function of a sphere. 

Theorem 2.4 (Miller and Thurston) Suppose f is a density function on ^ and P a 
set of n distinct points in 3?*^. Then there is a sphere S which j^-splits P such that 
Areaj(S) = 0 (^{totaljoolume{f))^^. 

Let E be a d-dimensional neighborhood system with density p. In the above theorem, 
P can be chosen to be the centers of E. By an appropriate choice of a density function, the 
following theorems can be derived: 

Theorem 2.5 (Intersection Graphs) If G is the intersection graph of a d-dimensional 
neighborhood system with density p, then G has an 0(p^n~^)-separator that j^-splits. 

Theorem 2.6 (Overlap Graphs) If G is the a-overlap graph of a d-dimensional neigh- 
borhood system with density p, then G has an 0{a.p'in~S~^-separator that j^-splits. 

Proof: In both the cases, the density function is chosen such that 

1. total joolume{f) is linearly bounded by n i.e. in the theorem on intersection graphs, 
totaLvolume(f) = 0{p'^n) and in the case of overlap graphs, total joolume(f) = 

d 1 

0{cx<*-i //«'-> n). 

2. The weighted surface area is an upper bound on the intersection number in the case 
of intersection graphs and on the overlap number in the second case. □ 
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2.3.4 A randomised algorithm for finding a sphere separator of low cost 

The discussions in the previous subsections suggests the following randomised algorithm for 
computing a sphere separator with a small area, given a set of points P in d-space : 
ALGORITHM 

1. Project the points in P onto Ud ( the unit d-sphere in centered at the ori^n o) 
using a mapping tt, such that the origin o is a centerpoint of the projected points. 
Such a TT is demonstrated in [MT90]. 

2. Randomly choose a great sphere GS of Ud- 

3. Using the inverse of tt, map GS back into d-space to get a (d — l)-sphere 5. 

Because o is a centerpoint of the projected points in (d + l)-space, the sphere S 
splits P. Moreover, as proved in the continuous separator theorem, with probability |, 5 

d—1 , 

has area {total. volume{ f ))~^ , where / is any density function defined on Si*. 

The running time of the above algorithm depends on the time needed to compute a 
centerpoint in (d4- l)-space; other steps take 0(n) time. However no efficient algorithms are 
known to compute a centerpoint in d-dimensions, d > 2. Using an approximate centerpoint 
algorithm, [Tng91] has shown how to compute a sphere separator with small area in random 
linear time. 


2.4 Graph separators in Numerical Analysis 

In this section, we discuss the applications of finding graph separators in Numerical Analysis. 
The classical applications are Nested Dissection [AG73] and Generalised Nested Dissection 
[LRT79] - techniques used for solving sparse linear systems of equations. 

Suppose we wish to solve the system of linear equations Ax = h by Gaussian elimination. 
A is an n X n symmetric, positive definite matrix, x is an n x 1 vector of variables and b is 
an n X 1 vector of constants. Usually the solution is obtained in two steps : first we factor A 
by means of row operations into A = LDL^ where L is lower triangular and D is diagonal. 
Next we solve the simplified systems Lz = b, Dy = z and L^x = y. 

If A is dense, then the time required for factoring A is 0(n®) and the time required for 
solving the simplified systems is O(n^). But if A is sparse, then we may be able to save time 
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and space by avoiding explicit manipulation of zeros. However, the factoring process may 
create nonzeros in L in positions where A had zeros. These nonzeros are called fill — in. 

The amount of fill — in can be reduced by suitably reordering the variables i.e. we 
transform A into A' = PAP^ where P is a permutation matrix and then solve the reordered 
system. Let A' = LDL^ . Let d{i) denote the number of nonzeros in column i of L. Then 
the number of nonzeros in L is m = d(i). The number of multiplications performed 

during the factorisation can be shown to be 5 d(i)(d(i) + 3) [Ros73]. So a major issue 

in sparse Gaussian elimination is that of finding a good ordering of the variables which 
reduces the number of nonzeros in X and the multiplication count. 

We can represent the given matrix A by means of an undirected graph G = (V,£). 
The graph G contains one vertex i € V for each row of A and one edge {t,i} € E for 
each pair of nonzero, off-diagonal elements a,j = aj,- ^ 0 in A. Each permutation matrix 
corresponds to a numbering of the vertices of G and the graph G represents the class PAP^ . 
See [RTL76] and [Ros73] for a discussion of the properties of this graph-theoretic model of 
sparse Gaussian elimination. 

Finding a good ordering of the variables i.e. a numbering of the vertices for an arbitrary 
graph seems to be very hard. However, for some special cases, good ordering schemes are 
known. 

1 . The Nested Dissection method of A.George [AG73] allows a linear system of equations 
whose graph is an n = fc x fc square grid to be solved in 0 (n 2 ) time and O(nlogre) 
space. His scheme uses the fact that removal of 0(fc) vertices from a fc x Ar grid dissects 
(i.e. separates) the grid and leaves four square grids, each roughly of size f X 

2. The above scheme has been generalised, with the same time and space bounds, to 
solve any system of equations, whose graph belongs to a class 5, where 5 is closed 
under the subgraph relation and satisfies a v^-separator theorem [LRT79]. Since the 
class of planar graphs satisfies the above two conditions [LR79], it follows that any 
system of equations whose graph is planar can be solved in 0 (ns ) time and 0 (n logn) 
space. The method employed to generate the ordering in the above scheme uses an 
algorithm for finding a v'n-separator for planar graphs. 



Chapter 3 


An Optimal Parallel Algorithm 
for Computing a Centerpoint in 
Two Dimensions 


3.1 Introduction 

The model of computation used for the parallel algorithms presented in this chapter is 
the exclusive-read, exclusive write (EIIEW) parallel random access machine, which does 
not allow simultaneous access by more than one processor to the same memory location , 
for either read or write purposes. Given a parallel algorithm, the product of the number 
of processors used and the time taken by the algorithm gives a measure of the number of 
operations performed. The parallel algorithm is said to be optimal if this is of the same order 
as the fastest known worst-case running time of a sequential algorithm for the problem. 

In this chapter, we present an optimal parallel algorithm for finding a centerpoint of a 
finite point set in two dimensions. In addition, we give an optimal parallel algorithm for 
finding a generalised ham-sandwich cut in the plane(separable case), which is used as a 
subroutine in the centerpoint algorithm. Both are in the EREW model. 

3.2 Preliminaries 

Consider a parallel algorithm with k stages. Suppose stage i can be implemented using p,- 
processors in time f,-. Let ?,• = Pi* ti~ 
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Let Q = Yii=i 9i = Y^i=i Now suppose fQ/2’1 processors are available. Then 

stage i can be implemented to run in ^ time. So the 

total time taken by the algorithm will be 2t,' + = 2T + = 0(^)* 

Thus the entire algorithm can be executed in 0(7) time using 0(Q) operations. 

We will be using the above idea, known as Brent’s lemma([Br74]), in the algorithms de- 
scribed below. We describe the algorithms as if different number of processors are available 
for the different stages and determine the total number of operations performed Q and the 
total time taken T. It will then be possible to implement the algorithms using \{QjT)\ 
processors to run in 0(7) time. The rescheduling of the algorithms is straight forward. See 
appendix B. 


3.3 An optimal parallel algorithm for finding a generalised 
ham-sandwich cut in two dimensions 

In this section, we present an optimal parallel algorithm for finding a generalised ham- 
sandwich cut(separable case) in the EREW model, which is a parallel version of Megiddo’s 
algorithm [Meg85]. The problem is considered in the dual plane. 

We are given two sets of lines : A={y = a,x -f 6,- : t=l to n} where each o,- > 0 and 
B={y = CiX + di : t=l to m} where each Ci < 0. Let N = n + m. Also given are two 
integers p and q, 1 < p < n and 1 < g < m. Let I{x*,y*) be the point where the level 
of A and the level of B meet. We are interested in finding the point I{x*^y*). 

The search for the point I is carried out as follows : we use a subroutine that determines, 
for any arbitrary line , the side on which / lies. In each iteration, a fixed fraction of the 
lines is discarded from the two sets, after identifying the side on which I lies for each of 
them. We do this by testing two ’special lines’ in the above sense. 

Given any line L:y = ax 9, we show how to determine the side on which I lies in 
0(log JVlog’^iV) time using ^ processors. There are four cases : a > 0, a < 0, 

a = 0 and a = oo (i.e. L is vertical). We consider the case when a > 0; other cases are 
similar. 


1. Find the intersections of the m lines in B with L, This can be done in 0(log N log* N) 
time using processors. 
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2. From among the i-coordinates of these intersection points, choose the largest one 

x'. This can be done in O(logi\riog* iV) time using processors using the 

selection algorithm of Cole [C0I88]. 

3. Let y'=ax' + 9. Find the intersections of the lines in A with the line x=x’ and from 

among the y-coordinates of these intersection points, choose the largest one yp. 
This can be done in 0(logiV log* N) time using processors. 

If yp=y' , we have found I. 

If yp > y', I lies above L. 

If Vp < y', I lies below L. 

So the question of determining on which side of a given line I lies can be answered in 
0(logiVlog*iV) time using processors where N is the sum of the cardinalities of 

tlift two sots. 

The parallel algoritlim for finding the cut has O(logJV) stages. In each stage, a fixed 
fraction 1 — /? (y3 > 0) of the lines are discarded after determining the side on which / lies 
for each of them. For the method described below, (3 turns out to be |. 

The pruning is carried out as follows : 

1. Find the median slope s of the lines (from both the sets) and then form two sets : 
one having lines with slope larger than s and the other having lines with slope smaller 
than s. Pair lines- one from each set and compute their point of intersection. 

2. From among the i-coordinates of the intersection points, find the median x-coordinate 

3. Test on which side of the line x = Xm, I(x*, y*) lies. Say > x*. Let 5 be the set 
of intersection points each of whose x-coordinate is greater or equal to Xm- 

4. Let S' be the set of lines of slope s through each point in S plus the lines of slope s 
from both the sets. Find a line of slope s which divides this S' into two equal parts 
and test on which side of this line I lies. Thus we identify an open quadrant Q in 
which I lies. 

5. Each pair associated with the closed quadrant, opposite to Q has a line that does not 
pass through Q i.e. we know the side on which I lies for this line. In addition, there 
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may be some lines of slope s that avoid Q. Altogether there are at least y lines which 
avoid Q and they can be removed from further consideration. The values of p and q 
are properly updated. 

Each step here can be done in 0(logXlog*X) time using processors where 

X = JV for the first stage, X = PN for the second stage, X = for the third stage and 
so on. See appendix B for a detailed analysis of the algorithm. The total work done = 
iV -)-;9Ar + 4- . . . =0(Af). The total time taken = logAf log* Af +log(/?Af)log*(/3Ar) + . . . 

= O(log^ N log* AT). So the whole algorithm can be implemented to run in 0(log^ AT log* Af ) 
time using processors. 

3.4 An optimal parallel algorithm for computing a center- 
point in two dimensions 

In this section, we present an optimal parallel algorithm for computing a centerpoint in the 
EllEW model which is a parallel version of the algorithm presented in [JM93]. 

Let P be the given set of n points. We briefly describe the algorithm given in [JM93] for 
finding a centerpoint. The notation Sh denotes the set of points contained in the half-plane 
H and San denotes the set of points common to the half- planes G and H. The algorithm 
finds four closed half-planes L, U, D and R, each containing less than fj] points of P and 
situated so that the intersection of their complements is a bounded open quadrilateral or a 
triangle. It is also ensured, while finding these half-planes, that each of the sets Swi ShDt 
Spu a^nd Srd contain at least - 1 points of P. Four points are then chosen, one from 
each of the above sets and if they form a convex quadrilateral, they are replaced by the 
point of the intersection of the diagonals of the quadrilateral and if they form a non-convex 
quadrilateral, the interior point is retained and the remaining three arc discarded. As any 
centerpoint of the new set is also a centerpoint of the original one, the algorithm proceeds to 
work on the new set. Thus in each stage, a fixed fraction of the input points are discarded 
and the work done per stage is linear, hence the whole algorithm runs in linear time. 

The four planes, mentioned above, are found as follows : from among the x-coordinates 
of all the points, the (ffl — one is found and the closed half- plane to the left of the 
perpendicular line through the point with this x-coordinate-is chosen as L. Then the ham- 
sandwich cut algorithm is used to partition 5 l in the ratio 1:3 and the set P — Si'm the ratio 
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3:5 so that one of the closed half-planes determined by the partitioning line contains exactly 
fj] — 1 points of P. This closed half- plane is chosen as U. It is ensured, while finding the 
cut, that Slu contains — 1 points of P. The closed half-plane D is determined similarly. 
To find R, the sets Su and P — Su are partitioned in a similar manner. By construction, 
the sets Sw, Sld and Sbu contain [^J - 1 points each and as shown in [JM93], the set 
Srd contains at least - 1 points. 

The parallel algorithm has O(logn) stages. In each stage, a fraction 1 — a of the points 
will be pruned as described above. 

A typical stage is implemented as follows: 

1. The four planes L, U, D and R are found using the selection algorithm of Cole [Col88] 
and the ham-sandwich cut algorithm described in section 3.3. 

2. The four sets Slu, Sld, Spu and Srd are found and then the points are pruned as 
described before. See appendix C for more details. 

Each stage can be implemented to run in O(log^ X log* X) time, doing 0(A) work 
where X = n for the first stage, X = an for the second stage and so on. So the total work 
done is 0(n) and the total time taken is 0(log^ n log* n). So the whole algorithm can be 
implemented to run in 0(log^ nlog*n) time using processors. 



Chapter 4 


The Notion of a Weighted 
Centerpoint 


4.1 Introduction 

There is another way of looking at the notion of a centerpoint of a planar set of points- as 
a generalisation of the concept of a median of a set of points on the real line. The concept 
of a median of a set of points on the real line has been extended to the ’weighted median’ 
notion, where each point is associated with a positive weight and a weighted median is 
delined as a point which gives a balanced partition of the total weight of all the points. A 
linear time algorithm for finding a weighted median is given in [llei78]. 

In a similar spirit, we define the notion of a weighted centerpoint of a finite planar set of 
points. We show that a weighted centerpoint always exists for any configuration of points 
with arbitrary weight assignments. We also propose a linear time algorithm for finding a 
weighted centerpoint of a configuration of points that form the vertices of a convex polygon 
(The input should be in the form of a list of vertices as they lie on the boundary of the 
convex polygon). Finally we give an 0(n log^ n) algorithm for finding a weighted centerpoint 
of any weighted configuration of points, generalising Cole et afs algorithm [CSY87, Col87]. 

This chapter is organised as follows: In section 4.2, we give a formal definition of the 
notion of a weighted centerpoint and prove that such a point exists for any finite planar 
configuration of points. In section 4.3, we present a linear time algorithm for finding a 
weighted centerpoint of a configuration of points that form the vertices of a convex poly- 
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gon. Section 4.4 discusses an algorithm for finding a weighted centerpoint of a general 
configuration of points. 

4.2 Definition and proof of the existence of a weighted cen- 
terpoint 

We are given a set V of n points in the plane. Each point p, is associated with a positive 
number tuj, denoting its weight. Let the total weight of all the points be W. 

Definition 4.1 A point x in the plane, not necessarily in V, is a weighted centerpoint if 
for any closed half-plane containing x the sum of the weights of the points included in it is 
at least 

Definition 4.2 The weighted center is the collection of all weighted centerpoints. 

We now show that a weighted centerpoint always exists for any configuration of points 
with arbitrary weight assignments. The proof is similar to the one given in [YB61] for the 
case of an ordinary centerpoint. 

Let TZ be the intersection of all dosed half-planes, the sum of the weights of the points 
included in each of which is strictly greater than W — f^K= [^j). 

We claim that any point belonging to is a weighted centerpoint. Otherwise, if x is a 
point in TZ which is not a weighted centerpoint then there exists a closed half-plane whose 
bounding line I contains x and the sum of the weights of the points included in it is less 
than f . Consider a line m, lying in the complementary open half-plane, parallel and 
sufficiently close to I so that none of the points of V lie between I and m. The sum of the 
weights of the points induded in the closed half-plane defined by m, which exdudes the 
line /, is greater than W — and hence x, being a point in TZ, should be in this dosed 
half-plane - a contradiction. 

It remains to show that TZ is always nonempty. We assume that the whole point set is 
covered by a disk and consider the infinite number of bounded convex regions obtained by 
intersecting each of the half-planes above with this disk. 

We show that any three of these have a common point and hence by Helly’s theorem it 
would follow that TZ is nonempty. 
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Figure 4.1: Distribution of the total weight of the points in the four quadrants 

Let two of the half-planes be situated as in figure 4.1 . Quadrant 1 is closed, whereas 
quadrant 3 is open; X and S are respectively the sum of the weights of the points included in 
these quadrants. Quadrant 4 is sort of half-closed in the sense that it includes all the points 
on the open half-line vc. Similarly, quadrant 2 includes all the points on the open half-line 
vb. Y and Z are respectively the sum of the weights of the points in these quadrants. 

Since X + Y > X + Z > f^], it follows that X is at least 

W - (2[^]) -1- 2. This means that if all the points in quadrant 1 are excluded, then the 
sum of the weights of the remaining points is less than or equal to 2 — 2, which is never 

greater than W — . So the third closed half-plane will have to include at least one point 

from quadrant 1 which means that any three closed half-planes have a common point. 

So we have the theorem : 

Theorem 4.1 The weighted center is always nonempty for any planar configuration of 
points. 


4.3 An algorithm for finding a weighted centerpoint of a 
convex polygon 


In this section, we present an algorithm for computing in linear time a weighted centerpoint 

of a set of points that the form the vertices of a convex polygon. The input is a list of 

vertices in the order in which they occur on the boundary of the convex polygon. It turns 

out that the weighted center itself can be computed in O(nlogn) time. -• ^ ^ A 

^ V & KANPUR 


I *4 ► ^ 
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Figure 4.2: Illustrating the definition of CHAIN 

Let P be a convex polygon with n vertices. The vertices are labeled from vi to Vn in 
the counterclockwise order (Vertex Vn+i is the same as vertex vi). Vertex V{ has weight Wi. 
Let W= ^ Wi. 

If any vertex has weight greater than , then that vertex is a weighted centerpoint 
(in fact, it is the only weighted centerpoint). This can be checked in linear time. So we 
assume that every Wi < . 

Let CHA^N(vi)=ViVi.^.l ...Vj,i < j represent a counterclockwise chain of vertices, begin- 
ning from Vi such that the sum of the weights of the vertices included in it just > 

Let CP{vi) denote the closed half-plane defined by the line through the vertices v,-_i and 
Vj which does not include the vertices ,Uj _2 and See figure 4.2. Such a 

half-plane CP{vi) is well defined for each i. 

Let TZ denote the intersection of all the CP(u,)’s. 

Lemma 4.1 Any point not belonging to TZ is not a weighted centerpoint. 

Proof : Let x be any such point. Since x does not belong to TZ, there is some CP(vi) which 
does not contain x. Now consider the closed half-plane, defined by a line through x parallel 
to the bounding line of CP(vi), which does not include CP(vi). The sum of the weights 
of the points included in this closed half-plane is less than f^j. So x is not a weighted 
centerpoint. □ 
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Figure 4.3: Diagram for the proof of Lemma 4-3 
Lemma 4.2 Any point belonging to TZ is a weighted centerpoint. 

Proof : Let x be any such point. Consider an arbitrary line through x. Since the given 
polygon is convex this line intersects the polygon boundary at exactly two points, A and B. 
The point x belongs to the closed segment AB (It is obvious that x lies inside the polygon). 
See figure 4.3. 

Let the sum of the weights of the vertices included in one of the closed half-planes, 
defined by the line through i, say the one indicated in figure 4.3, be less than f^]. In 
the chain from B to A, let M be the first vertex (possibly B) and let N be the last vertex 
(possibly A). The sum of the weights of the vertices included in the chain from M to 
N is less than So CHAIN(M) includes at least the vertex next to N. So CP{M) 

completely excludes the closed segment AB and hence the point x. This contradicts the 
fact that X belongs to TZ. □ 

Theorem 4.2 TZ is the weighted center and hence is nonempty. 

Proof : Follows from lemmas 4.1 and 4.2 and the fact that' a weighted centerpoint always 
exists for any configuration of points with arbitrary weight assignments. □ 

Though TZ is the intersection of n closed half-planes CP{vi), it may not be necessary to 
compute all of them. This is because if CHAJN{vi)=ViVi.^x. . .Vj and Wj is so large that 
CjEL4/iV(v,-+i)=i;i+ii;i+2. . . Uj, then ^^(ui+i) will not contribute anything to the intersection 
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TZ and hence need not be generated. We outline a procedure Generate_Closed_Halfplanes() 
below which generates all the necessary ones in linear time. 

Then a point belonging to their intersection which is a weighted centerpoint can be 
computed in 0(n) time using Megiddo’s Linear Programming technique [Meg83b]. Further, 
the whole intersection, and thus the weighted center, can be computed in O(nlogn) time 
[PS85]. 


PROCEDURE Generate_Closed_Half planes () 

Comments: start and end are two variables. The subroutine Increment(x) will 
msLke X point to the next vertex in the coimterclockwise order. 

Begin 

1) Initialise start and end as pointing to ui . 

2) While true do 

a) While the sum of the weights of the vertices included between start 

and end, both inclusive, is less than Increment{end) . 

b) ^Output* CP{start). 

c) Increment{start) until the sum of the weights of the vertices included 

between start and end, both inclusive, becomes just less than If 

start becomes vi again at any time during the above step, then break. 

End. 

The above procedure takes 0(n) time : 

• Step 2a executed at most n-14-n-2 = 2n-3 times. (When start is at t>„, end may be 
at most at Vn -2 having completed one full round) 

• Step 2b executed at most n times. 

• Step 2c ejtecuted exactly n times. 

4.4 An algorithm for the general case 

In this section, we present an O(nlog®n) algorithm for computing a weighted centerpoint 
of any weighted configuration of points. The algorithm presented is a generalisation of the 
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Figure 4.4: The six regions defined by a pair of points 

one presented in [CSY87] for computing an ordinary centerpoint. Using Cole’s technique 
[Col87], it can be improved to run in O(nlog® n) time. 

We are given a set P of n points. Each point p,- has a weight Wi associated with it. Let 
W be the total weight of all the points. The rotational order of the points in P about a 
given point p is defined as follows: Consider a horizontal line L through p. A new point set 
is obtained from P by retaining the points which are on or above L and taking the reflection 
through p of the points below L. The angular order of this new set about p defines the 
rotational order of the original set about p. 

We now sort the points in P using the rotational order about some point in the weighted 
center p* - which is not known at present but will be determined in the process. We use an 
m processor, h time parallel algorithm. The relative rotational order of pi and pj about the 
point p* depends on which of the six regions p* lies in (See figure 4.4). These regions are 
defined by the horizontal lines through p,- and pj and the line Lij passing through p,- and 
Pj- 

We use a subroutine, which given an arbitrary line i, tells whether the weighted center 
intersects L and if so, gives a point on L in the weighted center, and if not determines on 
which side of L, the weighted center lies. This subroutine takes O(nlog^ n) time. Using this 
subroutine, we perform a binary search on the n horizontal lines through the n points in 
P and determine a pair of consecutive lines between which the weighted center and hence 
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p* lies or find a weighted centerpoint on one of these n lines. This takes O(nlog‘*n) time. 
Now that we know the two consecutive lines between which the weighted center lies, all that 
we have to do to resolve the comparison between p,- and pj is to determine on which side 
of Lij p* lies. We do this by using Megiddo’s technique for Linear Programming in fixed, 
dimensions [Meg83b]. 

In each stage of the sorting algorithm, there are m lines generated, for each of which 
we have to determine on which side p* lies. In O(nlog^ n) time, we either determine for at 
least one eighth of the lines on which side p* lies or we find a weighted centerpoint in which 
case we stop. In the former case, we will have found two lines bounding the region in which 
p* lies. By iterating O(logTn) times, we either determine for every line on which side p* 
lies or we find a weighted centerpoint. Also in the former case, we will have determined a 
region containing p* bounded by O(log m) lines. 

To eliminate one eighth of the lines, we proceed as follows: Among the slopes of the m 
lines, we find the median value. Let it be s. We then pair lines so that each pair contains 
a line of slope less than s and a line of slope greater than s. For each pair, we compute 
the intersection point of the lines in the pair, and from among the x-coordinates of the 
intersection points, we determine the median value Xmed- We^Jlhen test whether the line 
L : X = Xyncd intersects the weighted center and if not, find on which side of L the weighted 
center lies. If L intersects the weighted center, then we get a point in the weighted center 
on L and we stop. So without loss of generality, assume that the weighted center lies to 
the left of L. We then find the line U of slope s which evenly divides the collection of lines 
of slope s consisting of those lines of slope a in the original set plus the lines of slope a 
through the intersection points to the right of L. We determine whether L' intersects the 
weighted center and if not, find on which side of L' the weighted center lies. Thus we either 
find a weighted centerpoint or the quadrant in which the weighted center and hence p* 
lies. For every intersection point in the opposite quadrant, one of the two lines through the 
point will avoid the quadrant in which p* lies. These lines can be eliminated from further 
consideration and they are at least one eighth of the original set. 

Thus when the algorithm (using either Preparata’s algorithm [Pre78] or the AKS net- 
work [AKS83]) terminates, we will have found either a weighted centerpoint or a region TZ, 
with respect to which the rotational order of the points in P is fixed. The whole algorithm 
takes 0(nlog® n) time. The region 72. is the intersection of /i=0(jiogn) regions, one for each 
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step of the sorting algorithm. Each such region is bounded by O(logm)= O(logn) lines. 
Since the rotational order of the points in P is the same about any point in TZ, either every 
point in 77 is a weighted centerpoint or none is. Since the weighted center is nonempty for 
any configuration of points with arbitrary weight assignments, one of the points in 77 must 
be a weighted centerpoint and hence every point is. 

We now describe an algorithm, which tests whether the weighted center intersects a given 
line L, giving a point on L if yes, and if no, determining on which side of L the weighted 
center lies. We can assume L to be the x — axis. For each orientation 6, — tt < ^ < ?r, 
and a point x, let Ib{x) denote the directed line through x with orientation 9. Let fe{x) be 
the ratio of the sum of the weights of the points which lie in the closed half-plane to the 
right of lg{x) to the total weight W. Define g{x)=mingf 9 {x), gi{x)=minQ^g<„fg{x), and 
92{x)=Tnin-^^g<ofg{x). 

The following observations are easy : 

1. Let L be any line (assumed to be the x — axis). Then for each 0, the function fg{x) 
is monotone as x varies along L. It is constant for ^ = 0 or jt, decreasing from 1 to 0 
for 9 € (0,7r) (the ’upward’ orientations), and increasing from 0 to 1 for € (— ir,0) 
(the ’downward’ orientations). 

2. gi{x) is a decreasing function on L and g 2 {x) is an increasing function on L. Let J 
be the interval of L over which 5i(a:)=fl'2(®)- This interval will always be non-empty. 
At all X € I, g{x) has the same value and g attains its maximum there. 

3. For any x, g{x) attains its minimum at some set 0(x) of orientations. If 0(x) con- 
sists only of ’upward’ (respectively ’downward’) orientations then x lies to the right 
(resp. left) of I, and conversely. Otherwise x lies in I and 0(x) has both types of 
orientations. 

We seek a point p in X. We search for this by finding the rotational order of the points 
in P about p, using an m processor, h time parallel algorithm. We compare two points 
Pi and pj as follows: Let Lij be the line through them. Let it intersect L in & point, x 
say. The result of the comparison depends solely on which side of x the point p lies. We 
rotate a line about x and compute fg{x) as 9 varies. Then using the third observation, 
we determine whether X includes x or if I is to the left or to the right of x. This takes 
O(nlogn). If X includes x, we choose x itself as p. So either we will resolve the comparison 
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Figure 4.5: Determining the side of L on which the weighted center lies 

or find the required point p. To batch m of these comparisons, we sort the resulting m 
points on L and perform a binary search among them (actually, find the medians etc.). 
This yields an interval J (containing p) over which the comparisons performed so far have 
a fixed result. This takes time O(nlognlogm + rn). Hence the whole algorithm takes 
0(/i(nlognlog m + m)) time. Using either Preparata’s algorithm(m = nlogn,h = logn) 
[Pre78] or the AKS network(m = n,h = logn) [AKS83], this becomes a time of O(nlog^n). 

When the algorithm terminates, we either have a point p g I or an open interval 
J = (AB) containing p with respect to which the rotational order of the points in P is 
fixed. Neither A nor B can be in I because in such a case, the algorithm would have output 
that point. Since the rotational order of P about any point in J is same, g has the same 
value throughout Since S contains p, a point of I, it follows that J = % and any point 
in will serve as p. So in all cases the algorithm finds p. If p(p) is greater or equal to 
one third, then p is a weighted centerpoint. Otherwise, p is not a weighted centerpoint 
and L does not intersect the weighted center. Since p(p) is less than one third, there exist 
two orientations, an ’upward’ 0\ and a ’downward’ 02? such that fsiip) —fhiP) ^ 5* 
figure 4.5. 
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It now follows that the weighted center of P lies in the shaded area of the figure 4.5 
(the side of L that this shaded area lies in depends on ffi), because for any point not in this 
shaded region either the line with orientation 0i or the one with orientation 02 is guaranteed 
to have points whose total weight is less than to its right. Thus if L does not intersect 
the weighted center, wc can determine on which side of L, the weighted center lies. 



Chapter 5 


Conclusion 


This thesis is on the centerpoint problem in Computational Geometry. Efficient algorithms 
for computing a centerpoint in two or higher dimensions have important applications. The 
following issues were discussed in this thesis : 

1. The application of computing a centerpoint in d-dimensions for finding graph separa- 
tors was discussed in chapter 2. A new class of graphs, known as overlap graphs, which 
includes many other classes of graphs which find applications in Numerical Analysis 
and Computational Geometry as special cases, was introduced. Next a randomised 
algorithm was presented to compute a small graph separator for any overlap graph. 
The use of finding graph separators in Numerical Analysis was thereafter outlined - 
for solving sparse linear systems of equations. 

2. A cost optimal parallel algorithm in the exclusive read exclusive write parallel random 

access machine model, for finding a centerpoint in two dimensions was presented in 
chapter 3. The time complexity was O(log^ nlog* n) and the number of processors re- 
quired was iog3 Jiog* n • addition, a cost optimal parallel algorithm for finding a gen- 

eralised ham-sandwich cut in two dimensions (linearly separated case) was presented, 
which was used as a subroutine in the centerpoint algorithm. The technique used is 
quite general and can be used to parallelise other prune-and-search algorithms as well. 
For example, we can give an optimal parallel algorithm in the EREW model, taking 
O(log^ n log* n) time for the two dimensional linear programming problem. However, 
X. Deng [Den90] has given an optim^ O(log n) time parallel algorithm for the above 
problem in the CRCW model. He observed that we do not need the full power of the 
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median for discarding a constant proportion of the constraints [Meg83b, Dy84] and 
used Cole’s approximate median finding algorithm [C0I88]. Whether incorporating 
approximation ideas (we need to define an approximate ham-sandwich cut) in the 
centerpoint algorithm leads to similar improvements is not known. 

3. In chapter 4, the notion of a centerpoint was extended to that of a weighted center- 
point. It was shown that a weighted centerpoint always exists for any configuration 
of points with arbitrary weight assignments. A linear time algorithm for finding a 
weighted centerpoint of a configuration of points that form the vertices of a convex 
polygon was also proposed. Next, an 0(nlog® n) algorithm for computing a weighted 
centerpoint of any weighted configuration of points was given; whether there is a linear 
time algorithm for this case is not known. 
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Appendix A 


The Ham-sandwich Cut Problem 
in the Dual Plane 


In this appendix, we introduce one specific geometric transform that can be used to map a 
set of points P in the plane to a set of lines L in the plane or vice versa, such that certain 
statements about P are true ilf their corresponding dual statements about L are true. We 
then describe the ham-sandwich cut problem in the dual plane. 

A.l A geometric transform 

Let p = (7ri,7r2) be a point in E^. Transform V maps p to the line V(p) given by the 
equation y = 2 ‘K\x — xj and vice versa, that is, it maps a nonvertical line / to a point P(/) 
such that P(P(/)) = 1 . 

If P is a set of points, then P(P) = {P(p) | P € P}. P is extended to sets of lines 
similarly. Note that the i-coordinate of p determines the slope of the line P(p). 

The following observations about P can be easily verified : 

Let p be a point in and let I be a nonvertical line in P^. 

• Incidence preservation : point p belongs to line I iff point P(I) belongs to line P(p). 

• Order preservation : point p lies above(below) line I iff point P(/) lies above(below) 
line P(p). 


r 
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A.2 The notion of an arrangement of lines and levels in 
arrangements 

A finite set L of lines in defines a dissection of into connected components of dimen- 
sions two, one and zero. We call this dissection the arrangement A{L) of L. 

Notation: Given a nonvertical line /, denotes the open half-plane lying above I and l~ 
denotes the open half-plane lying below /. 

Given an arrangement A{L) of n nonvertical lines in the plane, we define the following 
integer functions for each point p in the plane : 

• • p€.l~ for ai{p) lines I of L. 

• 0£.(p) P lies on I for ol(p) lines I of L. 

• ^l(p) • pEl'^ for bi{p) lines I of L. 

The A;-level Lk of the above arrangement is defined as the set of points p such that 
®l(p) < k and ox(p)H- 0 £,(p) > k, for each 1 < A: < n. Intuitively, it is a continuous, piecewise 
linear function whose segments always coincide with one of the lines in the arrangement. 

A. 3 The ham-sandwich cut problem in the dual plane 

We are given two sets of points in the plane : P with m points and Q with n points. These 
two sets are separated by a line with no point lying on the separating line. We axe interested 
in finding a nonvertical line I such that the number of points of P (Q ) contained in l~ is 
less than p (</), but when the points lying on I are counted, it is greater or equal to p (q ). 

We dualise the problem. Without loss of generality, we can assume that the separating 
line is the y-axis in the primal plane with P lying to its left and Q to its right. Then in 
the dual plane, all lines of P(P) will have negative slopes and all lines of t>{Q) will have 
positive slopes. Any level of A{'D{P)) is a monotonically decreasing function and that of 
A{p{Q)) is a monotonically increasing function and hence always intersect. 

Let T denote the point where the p** level of A(T>{P)) and the level of 
meet. Since r lies on the p^^ level of A{V(P)), the number of lines of V(P) lying above r 
is less than p but the number of lines lying above or passing through r is greater than or 
equal to p. A similar observation holds regarding the disposition of the lines of T>iQ) with 
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respect to r. These observations imply that in the primal plane, 'D{r)~ contajns less than 
p points of P and q points of Q but with the points lying on V{t) included, it contains at 
least p points of P and q points of Q, that is, V(r) is the line we aje looking for. Observe 
that V{r) contains exactly one point of P and Q each, if P U Q is in general position. 



Appendix B 


Detailed Analyses of the Parallel 
Algorithms 


In this appendix, we give detailed analyses of the parallel algorithms presented in chapter 3. 
We analyse certain steps in the ham-sandwich cut algorithm in detail - in particular, the 
subroutine for determining on which side of a given line JD, the point I lies (hereafter, 
referred to as line query) and step 1 of the pruning phase. The notations and the symbols 
used are the same ones used in section 3.3 of chapter 3. 

The first pruning phase of the ham-sandwich cut algorithm, as described, employs 
log N P^'^^^ssors. Since the model of computation is EREW, these processors can 
not siiniiltancously read a memory location. Hence the values of certain variables such as 
j/V, p etc which are needed by all the processors should be communicated to them and the 
time required for this should be taken into account. 

The line query routine is implemented as follows : 

1. The values of JV, m, n, p, g, a and 0 are communicated to all the processors. This 
takes 0{\o^{number^ofjproces3ors)) < 0{logN) time. 

2. To compute the intersections of the m lines in B with i, these m lines are divided 

into groups of size log N log* N each and each group is assigned to one 

processor (15^771^^7 processors remain idle) which computes the intersections of the 
lines in that group with X. This takes logiV log"^ N time. 

3. For selecting x\ we use Cole’s selection algorithm [C0I88] which can be implemented 
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in O(logm!og* 7n) time using P og^^g* processors. If the number of available pro- 
cessors is less than this, that is, \og}/\og*N logmlog*m ’ selection algorithm 

is rescheduled using j-g - yv^-g* processors which takes 0( tog^Tog*m ^ X 

log m log* m) time i.e. O(log iV log* j/V) time. This x' needs to be communicated to 
all the processors which takes less than log N time. 

4. The remaining steps can similarly be implemented in 0(logiVlog* JV) time. 

It is clear that the line query routine can be implemented in 0(log7Vlog* JV) time using 
processors. 

We now describe how to implement step 1 of the pruning phase in detail : 

1. The sets A and B are merged into a temporary array T of size JV. Set .4 is divided into 
K gT V T og ^' JV groups of size log JV log* JV each; each group is assigned to one processor 
which copies the lines in that group into T. B is similarly copied into T. This takes 
2 log JV log* JV time. 

2. s is found using Cole’s selection algorithm which takes 0(logJVlog* JV) time. 

3. Now two sets should be formed, one with lines of slope greater than s and the other 
with lines of slope smaller than 3. The first set is formed as follows : 

(a) T is divided into groups of size logJVlog*JV each and each group is 

assigned to a processor. The processor counts the number of lines in group 
t with slope greater than s and writes that count in location i in an array, say 
Count. This takes log JV log* JV time. , , 

(b) The prefix sum of the entries in Count is computed. The result is stored back 

in Count. Count has entries and the prefix sum of these can be 

computed in Iog( iogjvXg* 7 r) < log JV time using isilTToinT processors, 

(c) Now processor i scans its group again and selects lines with slope greater than s 
and moves them into an array (meant for storing these lines), using the entry in 
location z — 1 of Count as index. 

The second set is formed similarly. The time taken by these operations is 

0(logJVlog*JV). 
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4. Once the two sets are formed, lines - one from each set - can be paired and their point 
of intersection computed in log N log* N time. 

Hence the total time taken by step 1 of the pruning phase is 0(logiVlog*i\r). 

We have described the ham-sandwich cut algorithm as if different number of pro- 
cessors are available for the different stages of the algorithm. In the implementation 
described, stage i of the algorithm would use processors and take 

log(/?’“'iV)log*(/3‘~’iV) time. To implement the algorithm using jop jy processors, 
we have to reschedule certain stages of the algorithm. Stage i is implemented as fol- 
lows : The control processor compares a = with h,- = ' 

If a > 6f, then stage t is not rescheduled - so it runs in log(/?‘“'iV)iog*(;0‘“^iV) time 
using bi processors. If a < bi, then stage t is rescheduled using a processors. It is 
easy to verify that the time taken by the rescheduled stage i is 0(/?‘”^ log^ JV log* N -I- 
log{nuTnber-of 4 >rocessors)). Since the number ^of -processors = a is less than bi, we have 
log(a) < log(6i) < log(/?*“^iV) < log()9’“^iV)log*(;9’~^JV). In either case, the time re- 
quired by stage i is 0(/?‘~'log^lVlog*iV H- log(/3‘~^iV)log*(j3’~^lV)). It follows that the 
ham-sandwich cut algorithm can be implemented in 0(log^ JVlog* JV) time. 



Appendix C 


The Pruning Step of the 
Centerpoint Algorithm in More 
Detail 


In this appendix, we explain the pruning step of the parallel centerpoint algorithm presented 
in chapter 3 in more detail. The symbols and notations used in this appendix are the same 
ones introduced in section 3.4 of chapter 3. 

We denote the four sets involved in pruning Slui Sldj Srd and Sru by A, B, C and 
V respectively. Sets A, B and V contain - 1 points each. We assume that C contains 
exactly the same number of points. We make two observations below. We will not give 
formal proofs of these observations but only verify them by considering the various possible 
arrangements of the four planes L,U^ D and R. 

Observation 1 : Sets A and C do not have any common point of P. Similarly sets B and V 
are disjoint. We verify this observation by noting that no point of P belongs to all the four 
planes L, U, D and R in any arrangement of them. 

Observation 2 : There is a set disjoint from the other three sets. 

The pruning step in the parallel algorithm of chapter 3 is implemented as follows : 

1. Find four bit strings S/i, s/j, sc and sp corresponding to sets A, B, C and V respec- 
tively. In each bit string, bit i is set if point i belongs to that set. 
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Figure C.l: Eliminating triplets of points in case 2 

2. Find the following four intersections of these bit strings - sab (intersection of and 
■SB ), SBC') scD and SB A- If follows from the above two observations that following 
three cases can arise : 

• Case 2: Two of these intersections are non-null. 

• Case 1: One of these intersections is non- null. 

• Case 0: All the four sets are disjoint. 



Figure C.2: Eliminating triplets of points in case 1 
3. Case 2: Let o = jA n B|, 6 = |B 0 C| and X = \V\. Observe that X -a>h { which 



implies that X — b> o). Let a<b. Eliminate triplets of points as shown in figure C.l 
until all a points in ^ get exhausted. This now reduces to case 1. 

Case i: Let c = jC 0 2>I and X = j.4|. Eliminate triplets of points as shown in 
figure C.2 until C D P gets exhausted. This now reduces to case 0. 

Case 0: Four points, one from each of the sets A., B, C and V, are chosen and 
joined in that order. If they form a convex quadrilateral, they are replaced by the 
point of intersection of the diagonals of the quadrilateral. If they form a non-convex 
quadrilateral, the interior point is retained and other three are discarded. 

It is clear that the above pruning routine can be implemented using processors 

in time 0(log^ nlog* n). It also ensures that a fixed fraction of the points of P is discarded 
in each pruning phase. 

We now verify the two observations made before by considering the various possible 
arrangements of the four half-planes L, 17, D and R. We denote the lines defining these 
half-planes by I, u, d and r respectively. Without loss of generality, we can assume that 
the line I is vertical. Lines u and d will be nonvertical. The point of intersection of / and 
u is denoted by pi^. It is clear that the half-plane D can not contain the point i.e. d 
inlc'rsoctB I below the point pj„. Similarly r intersects u at a point p^u lying to the right of 
1 . 

Three cases arise depending on the slopes of u and d : 

• Case 1 : u and d are parallel. 

• Case 2 ; « and d intersect to the left of /. 

• Case 3 : u and d intersect to the right of 1. 

In each of the above cases, we consider the different positions which r can take and 
verify the observations in each of the resulting arrangements. Let a, 0 < a < x, denote the 
angle between tt and r, measured as r rotates about the point pm in the counterclockwise 
direction. 

Figure C.3 shows a few of the possible arrangements in case 1 as a takes different values. 
Since U f) D = <t>, observation 1 is satisfied. In the first three arrangements, A = Slu is 
disjoint from the remaining three sets and in the last one, B is disjoint from the remaining 
three sets. 







Figure C.3: Different arrangements when u and d are parallel 

Figure C.4 some of the possible arrangements in case 2. In this case, 0 < a < /? where 
/? is the angle shown in figure C.4(iii). Since a < /? (because (P — L)f\ (P — U) 0 (P — D) 
contains ^ points and R is required to contain ^ points oi P — U), R does not contain any 
point of I- n t/ n D (shown shaded in figure C.4(iii)), verifying observation 1. Observation 
2 can be easily verified. 




(iii) 


Figure C.4: Different arrangements when u and d intersect to the left of I 

In case 3 where u and d intersect to the right of I, Lf\Ur\D = and hence observation 1 
is true. The second observation can be verified by considering the different orientationi 
possible for r. 



